Missing Covariates : a Bayesian Approach

نویسندگان

  • John C. Naylor
  • Tony Pettit
چکیده

John C. Naylor Tony Pettit y Abstract Data on observed fault rates in an industrial process together with values of selected covariates, ordered through time, is analysed. The record of covariate values is incomplete for operational reasons and a particular feature of the analysis is dealing with these missing values. Of practical interest is the selection of covariates which might in uence the fault rate, with respect to possible future monitoring or control of the process. 1 The Problem The data to be modelled and analysed is observed defect rates for castings of one type produced at The Southern Cross Foundry during a period of a few weeks in 1989. Of the three types of defect noted only one (`blisters') will be considered as this is, for this casting type, the most serious possible defect. The data is recorded for several days on which this particular product was cast, includes values for covariates relating to the mold sand used, together with a record of the time and date of production. For a fuller description of the industrial situation and of possible covariates and their measurement see [3, 4]. The occurrence of defects has been recorded only as a percentage rate, 100 p say. The maximum run size for this type of casting is 250 but shorter runs are possible for a variety of reasons and it is likely that there is considerable variation in sample size for p. It is not then possible to consider models for p based on, for example, the binomial distribution. In any case, exploratory analysis suggests the variation in observed values for p is much greater than would be expected for a simple binomial model with Deparment of Mathematics, Statistics and O.R., Nottingham Trent University, Nottingham, UK. yHead of Department of Mathematics, Queensland University of Technology, Brisbane, Queensland, Australia. o o oo o

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

مقایسه روش بیزی (Bayesian) و کلاسیک در برآرد پارامترهای مدل رگرسیون لجستیک با وجود مقادیر گمشده در متغیرهای کمکی

Background and Aim: Logistic regression is an analytic tool widely used in medical and epidemiologic research. In many studies, we face data sets in which some of the data are not recorded. A simple way to deal with such "missing data" is to simply ignore the subjects with missing observations, and perform the analysis on cases for which complete data are available. Materials and Methods: We c...

متن کامل

A BAYESIAN APPROACH TO COMPUTING MISSING REGRESSOR VALUES

In this article, Lindley's measure of average information is used to measure the information contained in incomplete observations on the vector of unknown regression coefficients [9]. This measure of information may be used to compute the missing regressor values.

متن کامل

Semiparametric Bayesian Analysis of Matched Case-Control Studies With Missing Exposure

This article considers Bayesian analysis of matched case-control problems when one of the covariates is partially missing. Within the likelihood context, the standard approach to this problem is to posit a fully parametric model among the controls for the partially missing covariate as a function of the covariates in the model and the variables making up the strata. Sometimes the strata effects...

متن کامل

A pseudo-Bayesian shrinkage approach to regression with missing covariates.

We consider the linear regression of outcome Y on regressors W and Z with some values of W missing, when our main interest is the effect of Z on Y, controlling for W. Three common approaches to regression with missing covariates are (i) complete-case analysis (CC), which discards the incomplete cases, and (ii) ignorable likelihood methods, which base inference on the likelihood based on the obs...

متن کامل

A Bayesian analysis of mixture structural equation models with non-ignorable missing responses and covariates.

In behavioral, biomedical, and social-psychological sciences, it is common to encounter latent variables and heterogeneous data. Mixture structural equation models (SEMs) are very useful methods to analyze these kinds of data. Moreover, the presence of missing data, including both missing responses and missing covariates, is an important issue in practical research. However, limited work has be...

متن کامل

A Bayesian Semiparametric Model for Case - ControlStudies with Errors

We develop a model and a numerical estimation scheme for a Bayesian approach to inference in case-control studies with errors in covariables. The model proposed in this paper is based on a nonparametric model for the unknown joint distribution for the missing data, the observed covariates and the proxy. This nonparametric distribution deenes the measurement error component of the model which re...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007